Picture for Bowen Zhou

Bowen Zhou

CAF-Mamba: Mamba-Based Cross-Modal Adaptive Attention Fusion for Multimodal Depression Detection

Add code
Jan 29, 2026
Viaarxiv icon

HIPPO: Accelerating Video Large Language Models Inference via Holistic-aware Parallel Speculative Decoding

Add code
Jan 13, 2026
Viaarxiv icon

I2E: From Image Pixels to Actionable Interactive Environments for Text-Guided Image Editing

Add code
Jan 07, 2026
Viaarxiv icon

Effective Online 3D Bin Packing with Lookahead Parcels Using Monte Carlo Tree Search

Add code
Jan 06, 2026
Viaarxiv icon

InternVLA-A1: Unifying Understanding, Generation and Action for Robotic Manipulation

Add code
Jan 05, 2026
Viaarxiv icon

SCP: Accelerating Discovery with a Global Web of Autonomous Scientific Agents

Add code
Dec 30, 2025
Viaarxiv icon

Emotion-Director: Bridging Affective Shortcut in Emotion-Oriented Image Generation

Add code
Dec 22, 2025
Viaarxiv icon

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Add code
Dec 18, 2025
Viaarxiv icon

SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding

Add code
Dec 16, 2025
Figure 1 for SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding
Figure 2 for SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding
Figure 3 for SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding
Figure 4 for SDAR-VL: Stable and Efficient Block-wise Diffusion for Vision-Language Understanding
Viaarxiv icon

Accurate de novo sequencing of the modified proteome with OmniNovo

Add code
Dec 13, 2025
Viaarxiv icon